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Abstract (Basic) : US 20040003005 Al 

NOVELTY - The method involves providing multiple data records in 
one or more tables containing respective fields. The possible 
duplicate data records within the tables, are identified by measuring 
a co-occurrence of data in hierarchically related fields of the 
table . 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(1) system for evaluating duplicate records in database ; and 

(2) machine readable medium storing duplicate data record 
identification program. 

USE - For detecting duplicate data records in database of data 
processing system such as personal computer and server connected to 
network such as internet, intranet, local area network and wide area 
network . 

ADVANTAGE - The duplicate data records are accurately identified, 
using simple technique. 

DESCRIPTION OF DRAWING (S) - The figure shows a flowchart explaining 
the duplicate data record identification method. 
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Abstract (Basic) : US 20030167253 Al 

NOVELTY - The method involves adding a designated record to the 
potential family of records when it is automatically determined that 
the designated record is not a duplicate of the records in the 
family - An indicator is automatically set in each of the data records 
in the 'potential family of records to indicate a family 
relationship between the records . 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(a) a computer-readable memory medium 

(b) a record management system. 

USE - Used for identifying relationship among data records. 
ADVANTAGE - The method automatically determines missing information 
from a particular data record and also deletes duplicate records. 

DESCRIPTION OF DRAWING (S) - The drawing shows an example flowchart 
of a routine for importing and automatically de- duplicating a file of 
new data records . 
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Abstract (Basic) : EP 1103911 Al 

NOVELTY - Automated method (A) for identifying related biomolecular 
sequences having defined features of interest from databases , is new. 
The databases comprise at least a first and a second set of sequences 
and each set is derived from a different type of organism. 

DETAILED DESCRIPTION - Automated method (A) for identifying related 
biomolecular sequences having defined features of interest from 
databases comprises : 

(1) establishing from the first set of sequences a non-redundant 
list of query sequences having the defined features of interest (first 
family members) , using a database search program; 

(2) performing sequence alignments with the first family members 
in a second set of sequences derived from a second type of organism, 
using a database search program and a preset similarity threshold, 
giving a list of second family members; 

(3) establishing a two dimensional matrix displaying the first and 
second family members and their respective similarity values 
resulting from step (2), optionally displaying only those second 
family members having similarity values exceeding a preset threshold 
value; 

(4) selecting from the matrix those pairs of first and second 
family members for which the similarity values are the best among all 
of the alignments that involve one of the two pair's members 
(orthologs ) . 

USE - The method is used for identifying related biomolecular 
sequences having defined features of interest from databases of 
nucleic acid and amino acid sequences. The method can identify related 
or homologous sequences which can then be analyzed further in a 
laboratory environment. The features of interest can be specific 
classes of proteins or protein domains which can then serve as drug 
targets and are of high interest to researchers aimed at finding new 
drugs . 

ADVANTAGE - The method is a high speed process for identifying the 
best-matching pair of orthologs. The basic parameters defining the 
sequences of interest and threshold values for the database searches 
can be set up to suit the needs of the researcher. Data can be analyzed 
off-line which saves time and costs. 
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Abstract (Basic) : US 6128626 A 

NOVELTY - A product directory index and data table structures for 
storing number of respective index and data entries are stored in 
tables (200-1-200-4) in database (200) . A set of data retrieval 
components are provided in database for accessing index value and 
data entries and database is accessed via multiple client system to 
generate bill of material document for board assembly products. 

DETAILED DESCRIPTION - The product directory index table structure 
storing index value entries coded in preset manner representing all 
board assembly products being currently manufactured is stored in a 
first table in database . Each index value entry contains key values 
having file key value coded for extracting data entries from other 
tables contained in database including information related to 
particular board assembly product and file version value coded for 
designating points within the database table where entries are added 
or deleted. Data table structures storing data entries containing 
different assembly board product related information used for 
generating bill of materials (BOM) documents used in manufacturing 
board assembly products is stored in other tables in the database . 
Each entry contains key values defining data selection criteria used in 
extracting entries utilized by particular board assembly product. The 
key values in each data entry includes ON and OFF field values which 
are set for identifying version of board products with and without data 
entry. The ON and OFF field values are used along with file version 
value for defining data selection criteria. The database is accessed 
via multiple client system having copy of data retrieval components to 
generate BOM document for board assembly products. An INDEPENDENT CLAIM 
is also included for software database system. 

USE - For organizing database for minimizing storage amount of 
duplicate information used for generating bill of material documents 
for manufacturing printed circuit board assembly products. 

ADVANTAGE - Enables operator or user to access database for 
obtaining information pertaining to particular PCB assembly. 

DESCRIPTION OF DRAWING (S) - The figure shows components of 
database data retrieval software. 
Database (200) 

Tables (200-1-200-4) 
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